NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Graph Neural Networks Gone Hogwild

Solodova, Olga; Richardson, Nick; Oktay, Deniz; Adams, Ryan P (April 2025, International Conference on Learning Representations (ICLR 2025))

Free, publicly-accessible full text available April 28, 2026
A rapid and automated computational approach to the design of multistable soft actuators

https://doi.org/10.1016/j.cpc.2024.109090

Mirramezani, Mehran; Oktay, Deniz; Adams, Ryan P. (May 2024, Computer Physics Communications)

Full Text Available
Fiber Monte Carlo

Richardson, Nick; Oktay, Deniz; Ovadia, Yaniv; Bowden, James C; Adams, Ryan P (May 2024, The Twelfth International Conference on Learning Representations (ICLR 2024))

Full Text Available
JAX FDM: A differentiable solver for inverse form-finding

Pastrana, Rafael; Oktay, Deniz; Adams, Ryan P; Adriaenssens, Sigrid (June 2023, International Conference on Machine Learning 2023)

Full Text Available
Neuromechanical Autoencoders: Learning to Couple Elastic and Neural Network Nonlinearity

Oktay, Deniz; Mirramezani, Mehran; Medina, Eder; Adams, Ryan P. (April 2023, International Conference on Learning Representations (ICLR))

Intelligent biological systems are characterized by their embodiment in a complex environment and the intimate interplay between their nervous systems and the nonlinear mechanical properties of their bodies. This coordination, in which the dynamics of the motor system co-evolved to reduce the computational burden on the brain, is referred to as "mechanical intelligence" or "morphological computation". In this work, we seek to develop machine learning analogs of this process, in which we jointly learn the morphology of complex nonlinear elastic solids along with a deep neural network to control it. By using a specialized differentiable simulator of elastic mechanics coupled to conventional deep learning architectures---which we refer to as neuromechanical autoencoders---we are able to learn to perform morphological computation via gradient descent. Key to our approach is the use of mechanical metamaterials---cellular solids, in particular---as the morphological substrate. Just as deep neural networks provide flexible and massively-parametric function approximators for perceptual and control tasks, cellular solid metamaterials are promising as a rich and learnable space for approximating a variety of actuation tasks. In this work we take advantage of these complementary computational concepts to co-design materials and neural network controls to achieve nonintuitive mechanical behavior. We demonstrate in simulation how it is possible to achieve translation, rotation, and shape matching, as well as a "digital MNIST" task. We additionally manufacture and evaluate one of the designs to verify its real-world behavior.
more » « less
Full Text Available
Randomized Automatic Differentiation

Oktay, Deniz; McGreivy, Nick; Aduol, Joshua; Beatson, Alex; Adams, Ryan P. (January 2021, International Conference on Learning Representations (ICLR))
null (Ed.)
The successes of deep learning, variational inference, and many other fields have been aided by specialized implementations of reverse-mode automatic differentiation (AD) to compute gradients of mega-dimensional objectives. The AD techniques underlying these tools were designed to compute exact gradients to numerical precision, but modern machine learning models are almost always trained with stochastic gradient descent. Why spend computation and memory on exact (minibatch) gradients only to use them for stochastic optimization? We develop a general framework and approach for randomized automatic differentiation (RAD), which can allow unbiased gradient estimates to be computed with reduced memory in return for variance. We examine limitations of the general approach, and argue that we must leverage problem specific structure to realize benefits. We develop RAD techniques for a variety of simple neural network architectures, and show that for a fixed memory budget, RAD converges in fewer iterations than using a small batch size for feedforward networks, and in a similar number for recurrent networks. We also show that RAD can be applied to scientific computing, and use it to develop a low-memory stochastic gradient method for optimizing the control parameters of a linear reaction-diffusion PDE representing a fission reactor.
more » « less
Full Text Available

Search for: All records